Skip to content

Conversation

@kimminsu38oo
Copy link
Contributor

This PR fixes a logging issue in OpenCL profiling where the local_size was printed incorrectly.

Previously, the profiling output printed local_size[2] twice and missed local_size[1]. This resulted in wrong log entries like:

64x1x1  → 64x1x1  → 64x1x1  → 2x1x1  → ...

This fix ensures correct output of all three dimensions:
local_size[0] x local_size[1] x local_size[2]

@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Apr 10, 2025
@lhez
Copy link
Collaborator

lhez commented Apr 16, 2025

@max-krasnyansky I think we can merge this.

@max-krasnyansky max-krasnyansky merged commit 12b1750 into ggml-org:master Apr 16, 2025
51 checks passed
@kimminsu38oo kimminsu38oo deleted the fix/ggml-opencl-profiling-log branch April 18, 2025 10:36
colout pushed a commit to colout/llama.cpp that referenced this pull request Apr 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants